Speedup of Particle Transport Problems with a Beowulf Cluster
نویسندگان
چکیده
The MCNP code is a general Monte Carlo N-Particle Transport program that is widely used in health physics, medical physics and nuclear engineering for problems involving neutron, photon and electron transport. However, due to the stochastic nature of the algorithms employed to solve the Boltzmann transport equation, MCNP generally exhibits a slow rate of convergence. In fact, engineers and scientists can quickly identify intractable versions of their most challenging and CPU-intensive problems. For example, despite the latest advancements in personal computers (PCs) and quantum leaps in their computational capabilities, an ordinary electron transport problem could require up to several CPU-days or even CPU-weeks on a typical desktop PC of today. One common contemporary approach to help address these performance limitations is by taking advantage of parallel processing. In fact, the very nature of the Monte Carlo approach embedded within MCNP is inherently parallel because, at least in principle, every particle history can potentially be tracked individually in an independent processor. In practice, however, there are many issues that must be confronted to achieve a reasonable level of parallelization. First, of course, a suitable parallel computing platform is required. Next, the computer program itself should exploit parallelism from within by combining such tools as Fortran-90 and PVM, for example. This article describes the installation and performance testing of the latest release of MCNP, Version 5 (MCNP5), compiled with PGI Fortran-90 and with PVM on a recently assembled 22-node Beowulf cluster that is now a dedicated platform for the faculty and students of the University of Cincinnati’s Nuclear and Radiological Engineering (UCNRE) Program. The performance of a neutron transport problem and that of a more challenging gamma-electron (coupled) problem are both highlighted. The results show that the PVM-compiled MCNP5 version with 20 tasks can execute roughly 12 times faster than the sequential version of the tested neutron transport problem, whereas another increasingly challenging gamma-electron (coupled) dose problem executed nearly 18 times faster than its serial counterpart, this performance on 20 processors is an impressive result relative to a linear (ideal) speedup. Kew words: Beowulf cluster, speedup, particle transport problems
منابع مشابه
Development of a Parallel Electron and Photon Transport Code (PMCEPT) I: Method and Absorbed Dose Computation in Water
The Monte Carlo (MC) method for high-energy photon and charged particle transport is the most accurate for dose calculations in radiotherapy. However, the large amount of computing time required by general purpose MC codes has prevented their use for routine dose distribution calculations for customized radiation treatment planning. One of the best ways to provide an accurate dose distribution ...
متن کاملDistributed BEAGLE: An Environment for Parallel and Distributed Evolutionary Computations
Evolutionary computation is a promising artificial intelligence field involving the simulation of natural evolution to solve problems. Given its implicit parallelism and high computational requirements, evolutionary computation is the perfect candidate for high performance parallel computers. This paper presents Distributed BEAGLE, a new master-slave architecture for parallel and distributed ev...
متن کاملDistributed Processing of High-speed Stereo Particle Image Velocimetry Data with a Low Power Beowulf Cluster
In this paper, we present our design of a portable cluster supercomputer created specifically for processing PIV image data. To make the system portable for a laboratory environment, the cluster computer is required to run on minimal power and be light weight. The final cluster design was based on the Intel R ©QuadCore Xeon R ©X3210 low-power processors. This hardware setup allows for a total s...
متن کاملBeowulf Cluster Performance for a Dynamic Programming Solution of a Two-class Queueing Network
We describe an experiment designed to determine if a dedicated parallel cluster is effective in solving certain queueing network problems using dynamic programming. We show that the same domain decomposition method that is commonly used for solving elliptic partial differential equations with finite differences is effective for this class of problems and yields roughly the same parallel efficie...
متن کاملElectromagnetics Computations Using the MPI Parallel Implementation of the Steepest Descent Fast Multipole Method (SDFMM)
The computational solution of large-scale linear systems of equations necessitates the use of fast algorithms but is also greatly enhanced by employing parallelization techniques. The objective of this work is to demonstrate the speedup achieved by the MPI (Message Passing Interface) parallel implementation of the Steepest Descent Fast Multipole Method (SDFMM). Although this algorithm has alrea...
متن کامل